Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 590 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 128 |
| Duplicate rows (%) | 21.7% |
| Total size in memory | 64.7 KiB |
| Average record size in memory | 112.2 B |
Variable types
| Numeric | 13 |
|---|---|
| Categorical | 1 |
| Dataset has 128 (21.7%) duplicate rows | Duplicates |
great has 455 (77.1%) zeros | Zeros |
spiritual has 465 (78.8%) zeros | Zeros |
mind has 451 (76.4%) zeros | Zeros |
shall has 446 (75.6%) zeros | Zeros |
things has 388 (65.8%) zeros | Zeros |
heart has 448 (75.9%) zeros | Zeros |
knowledge has 470 (79.7%) zeros | Zeros |
soul has 459 (77.8%) zeros | Zeros |
may has 464 (78.6%) zeros | Zeros |
life has 392 (66.4%) zeros | Zeros |
men has 471 (79.8%) zeros | Zeros |
man has 343 (58.1%) zeros | Zeros |
one has 340 (57.6%) zeros | Zeros |
Reproduction
| Analysis started | 2021-04-30 09:58:21.288390 |
|---|---|
| Analysis finished | 2021-04-30 09:58:47.841236 |
| Duration | 26.55 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 8 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3457627119 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros | 455 |
| Zeros (%) | 77.1% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.7898597657 |
|---|---|
| Coefficient of variation (CV) | 2.284398342 |
| Kurtosis | 17.36315425 |
| Mean | 0.3457627119 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.493188915 |
| Sum | 204 |
| Variance | 0.6238784495 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=8)
| Value | Count | Frequency (%) |
| 0 | 455 | |
| 1 | 92 | 15.6% |
| 2 | 29 | 4.9% |
| 3 | 8 | 1.4% |
| 4 | 3 | 0.5% |
| 7 | 1 | 0.2% |
| 6 | 1 | 0.2% |
| 5 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 455 | |
| 1 | 92 | 15.6% |
| 2 | 29 | 4.9% |
| 3 | 8 | 1.4% |
| 4 | 3 | 0.5% |
| Value | Count | Frequency (%) |
| 7 | 1 | 0.2% |
| 6 | 1 | 0.2% |
| 5 | 1 | 0.2% |
| 4 | 3 | 0.5% |
| 3 | 8 |
| Distinct | 11 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5237288136 |
|---|---|
| Minimum | 0 |
| Maximum | 15 |
| Zeros | 465 |
| Zeros (%) | 78.8% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 3 |
| Maximum | 15 |
| Range | 15 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.408901965 |
|---|---|
| Coefficient of variation (CV) | 2.690136439 |
| Kurtosis | 28.52126027 |
| Mean | 0.5237288136 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.488912451 |
| Sum | 309 |
| Variance | 1.985004748 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=11)
| Value | Count | Frequency (%) |
| 0 | 465 | |
| 1 | 53 | 9.0% |
| 2 | 31 | 5.3% |
| 3 | 16 | 2.7% |
| 4 | 9 | 1.5% |
| 5 | 8 | 1.4% |
| 8 | 4 | 0.7% |
| 15 | 1 | 0.2% |
| 10 | 1 | 0.2% |
| 7 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 465 | |
| 1 | 53 | 9.0% |
| 2 | 31 | 5.3% |
| 3 | 16 | 2.7% |
| 4 | 9 | 1.5% |
| Value | Count | Frequency (%) |
| 15 | 1 | 0.2% |
| 10 | 1 | 0.2% |
| 8 | 4 | |
| 7 | 1 | 0.2% |
| 6 | 1 | 0.2% |
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5050847458 |
|---|---|
| Minimum | 0 |
| Maximum | 43 |
| Zeros | 451 |
| Zeros (%) | 76.4% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.023413355 |
|---|---|
| Coefficient of variation (CV) | 4.006086844 |
| Kurtosis | 331.8870089 |
| Mean | 0.5050847458 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.20146962 |
| Sum | 298 |
| Variance | 4.094201606 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) |
| 0 | 451 | |
| 1 | 75 | 12.7% |
| 2 | 36 | 6.1% |
| 3 | 14 | 2.4% |
| 4 | 8 | 1.4% |
| 7 | 2 | 0.3% |
| 43 | 1 | 0.2% |
| 9 | 1 | 0.2% |
| 6 | 1 | 0.2% |
| 5 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 451 | |
| 1 | 75 | 12.7% |
| 2 | 36 | 6.1% |
| 3 | 14 | 2.4% |
| 4 | 8 | 1.4% |
| Value | Count | Frequency (%) |
| 43 | 1 | |
| 9 | 1 | |
| 7 | 2 | |
| 6 | 1 | |
| 5 | 1 |
| Distinct | 28 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.979661017 |
|---|---|
| Minimum | 0 |
| Maximum | 32 |
| Zeros | 446 |
| Zeros (%) | 75.6% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 14.55 |
| Maximum | 32 |
| Range | 32 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4.981077254 |
|---|---|
| Coefficient of variation (CV) | 2.516126353 |
| Kurtosis | 9.19126522 |
| Mean | 1.979661017 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.010327452 |
| Sum | 1168 |
| Variance | 24.81113061 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=28)
| Value | Count | Frequency (%) |
| 0 | 446 | |
| 1 | 35 | 5.9% |
| 2 | 13 | 2.2% |
| 6 | 9 | 1.5% |
| 8 | 8 | 1.4% |
| 9 | 8 | 1.4% |
| 5 | 7 | 1.2% |
| 11 | 6 | 1.0% |
| 3 | 6 | 1.0% |
| 10 | 6 | 1.0% |
| Other values (18) | 46 | 7.8% |
| Value | Count | Frequency (%) |
| 0 | 446 | |
| 1 | 35 | 5.9% |
| 2 | 13 | 2.2% |
| 3 | 6 | 1.0% |
| 4 | 5 | 0.8% |
| Value | Count | Frequency (%) |
| 32 | 1 | |
| 27 | 1 | |
| 26 | 2 | |
| 24 | 2 | |
| 23 | 1 |
therefore
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.7 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | 20 |
| 3 | 7 |
| 6 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 590 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 469 | |
| 1 | 92 | 15.6% |
| 2 | 20 | 3.4% |
| 3 | 7 | 1.2% |
| 6 | 2 | 0.3% |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 0 | 469 | |
| 1 | 92 | 15.6% |
| 2 | 20 | 3.4% |
| 3 | 7 | 1.2% |
| 6 | 2 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 469 | |
| 1 | 92 | 15.6% |
| 2 | 20 | 3.4% |
| 3 | 7 | 1.2% |
| 6 | 2 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 590 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 469 | |
| 1 | 92 | 15.6% |
| 2 | 20 | 3.4% |
| 3 | 7 | 1.2% |
| 6 | 2 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 590 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 469 | |
| 1 | 92 | 15.6% |
| 2 | 20 | 3.4% |
| 3 | 7 | 1.2% |
| 6 | 2 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 590 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 469 | |
| 1 | 92 | 15.6% |
| 2 | 20 | 3.4% |
| 3 | 7 | 1.2% |
| 6 | 2 | 0.3% |
| Distinct | 11 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.793220339 |
|---|---|
| Minimum | 0 |
| Maximum | 12 |
| Zeros | 388 |
| Zeros (%) | 65.8% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 4 |
| Maximum | 12 |
| Range | 12 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.611067875 |
|---|---|
| Coefficient of variation (CV) | 2.031047107 |
| Kurtosis | 10.72393293 |
| Mean | 0.793220339 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.010490579 |
| Sum | 468 |
| Variance | 2.595539697 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=11)
| Value | Count | Frequency (%) |
| 0 | 388 | |
| 1 | 107 | 18.1% |
| 2 | 37 | 6.3% |
| 3 | 16 | 2.7% |
| 4 | 15 | 2.5% |
| 6 | 12 | 2.0% |
| 5 | 6 | 1.0% |
| 9 | 4 | 0.7% |
| 7 | 3 | 0.5% |
| 12 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 388 | |
| 1 | 107 | 18.1% |
| 2 | 37 | 6.3% |
| 3 | 16 | 2.7% |
| 4 | 15 | 2.5% |
| Value | Count | Frequency (%) |
| 12 | 1 | 0.2% |
| 9 | 4 | 0.7% |
| 8 | 1 | 0.2% |
| 7 | 3 | 0.5% |
| 6 | 12 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4610169492 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 448 |
| Zeros (%) | 75.9% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 3 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.009802036 |
|---|---|
| Coefficient of variation (CV) | 2.190379417 |
| Kurtosis | 7.278534536 |
| Mean | 0.4610169492 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.652933735 |
| Sum | 272 |
| Variance | 1.019700153 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) |
| 0 | 448 | |
| 1 | 74 | 12.5% |
| 2 | 31 | 5.3% |
| 3 | 21 | 3.6% |
| 4 | 8 | 1.4% |
| 5 | 7 | 1.2% |
| 6 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 448 | |
| 1 | 74 | 12.5% |
| 2 | 31 | 5.3% |
| 3 | 21 | 3.6% |
| 4 | 8 | 1.4% |
| Value | Count | Frequency (%) |
| 6 | 1 | 0.2% |
| 5 | 7 | 1.2% |
| 4 | 8 | 1.4% |
| 3 | 21 | |
| 2 | 31 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3457627119 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 470 |
| Zeros (%) | 79.7% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.8479120534 |
|---|---|
| Coefficient of variation (CV) | 2.452294664 |
| Kurtosis | 13.12787846 |
| Mean | 0.3457627119 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.315295881 |
| Sum | 204 |
| Variance | 0.7189548502 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) |
| 0 | 470 | |
| 1 | 72 | 12.2% |
| 2 | 29 | 4.9% |
| 3 | 8 | 1.4% |
| 4 | 7 | 1.2% |
| 6 | 2 | 0.3% |
| 5 | 2 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 470 | |
| 1 | 72 | 12.2% |
| 2 | 29 | 4.9% |
| 3 | 8 | 1.4% |
| 4 | 7 | 1.2% |
| Value | Count | Frequency (%) |
| 6 | 2 | 0.3% |
| 5 | 2 | 0.3% |
| 4 | 7 | 1.2% |
| 3 | 8 | 1.4% |
| 2 | 29 |
| Distinct | 8 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4423728814 |
|---|---|
| Minimum | 0 |
| Maximum | 8 |
| Zeros | 459 |
| Zeros (%) | 77.8% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2.55 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.046897493 |
|---|---|
| Coefficient of variation (CV) | 2.366549887 |
| Kurtosis | 13.79084407 |
| Mean | 0.4423728814 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.297899993 |
| Sum | 261 |
| Variance | 1.09599436 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=8)
| Value | Count | Frequency (%) |
| 0 | 459 | |
| 1 | 62 | 10.5% |
| 2 | 39 | 6.6% |
| 3 | 14 | 2.4% |
| 4 | 8 | 1.4% |
| 5 | 5 | 0.8% |
| 8 | 2 | 0.3% |
| 6 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 459 | |
| 1 | 62 | 10.5% |
| 2 | 39 | 6.6% |
| 3 | 14 | 2.4% |
| 4 | 8 | 1.4% |
| Value | Count | Frequency (%) |
| 8 | 2 | 0.3% |
| 6 | 1 | 0.2% |
| 5 | 5 | 0.8% |
| 4 | 8 | |
| 3 | 14 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3288135593 |
|---|---|
| Minimum | 0 |
| Maximum | 8 |
| Zeros | 464 |
| Zeros (%) | 78.6% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.7970767575 |
|---|---|
| Coefficient of variation (CV) | 2.424099417 |
| Kurtosis | 21.0957728 |
| Mean | 0.3288135593 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.800972248 |
| Sum | 194 |
| Variance | 0.6353313574 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) |
| 0 | 464 | |
| 1 | 88 | 14.9% |
| 2 | 21 | 3.6% |
| 3 | 9 | 1.5% |
| 4 | 6 | 1.0% |
| 8 | 1 | 0.2% |
| 5 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 464 | |
| 1 | 88 | 14.9% |
| 2 | 21 | 3.6% |
| 3 | 9 | 1.5% |
| 4 | 6 | 1.0% |
| Value | Count | Frequency (%) |
| 8 | 1 | 0.2% |
| 5 | 1 | 0.2% |
| 4 | 6 | 1.0% |
| 3 | 9 | |
| 2 | 21 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6050847458 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 392 |
| Zeros (%) | 66.4% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.054527587 |
|---|---|
| Coefficient of variation (CV) | 1.742776684 |
| Kurtosis | 3.939663748 |
| Mean | 0.6050847458 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.018322398 |
| Sum | 357 |
| Variance | 1.112028431 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) |
| 0 | 392 | |
| 1 | 107 | 18.1% |
| 2 | 47 | 8.0% |
| 3 | 25 | 4.2% |
| 4 | 15 | 2.5% |
| 5 | 3 | 0.5% |
| 6 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 392 | |
| 1 | 107 | 18.1% |
| 2 | 47 | 8.0% |
| 3 | 25 | 4.2% |
| 4 | 15 | 2.5% |
| Value | Count | Frequency (%) |
| 6 | 1 | 0.2% |
| 5 | 3 | 0.5% |
| 4 | 15 | 2.5% |
| 3 | 25 | |
| 2 | 47 |
| Distinct | 8 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3881355932 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros | 471 |
| Zeros (%) | 79.8% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2.55 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.9924305616 |
|---|---|
| Coefficient of variation (CV) | 2.556917167 |
| Kurtosis | 13.68518559 |
| Mean | 0.3881355932 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.459565272 |
| Sum | 229 |
| Variance | 0.9849184196 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=8)
| Value | Count | Frequency (%) |
| 0 | 471 | |
| 1 | 68 | 11.5% |
| 2 | 21 | 3.6% |
| 3 | 16 | 2.7% |
| 4 | 5 | 0.8% |
| 6 | 4 | 0.7% |
| 5 | 4 | 0.7% |
| 7 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 471 | |
| 1 | 68 | 11.5% |
| 2 | 21 | 3.6% |
| 3 | 16 | 2.7% |
| 4 | 5 | 0.8% |
| Value | Count | Frequency (%) |
| 7 | 1 | 0.2% |
| 6 | 4 | 0.7% |
| 5 | 4 | 0.7% |
| 4 | 5 | 0.8% |
| 3 | 16 |
| Distinct | 15 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.433898305 |
|---|---|
| Minimum | 0 |
| Maximum | 14 |
| Zeros | 343 |
| Zeros (%) | 58.1% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2 |
| 95-th percentile | 8 |
| Maximum | 14 |
| Range | 14 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.617987289 |
|---|---|
| Coefficient of variation (CV) | 1.825783097 |
| Kurtosis | 5.906129775 |
| Mean | 1.433898305 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.441990711 |
| Sum | 846 |
| Variance | 6.853857443 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=15)
| Value | Count | Frequency (%) |
| 0 | 343 | |
| 1 | 88 | 14.9% |
| 2 | 51 | 8.6% |
| 4 | 25 | 4.2% |
| 3 | 25 | 4.2% |
| 7 | 13 | 2.2% |
| 8 | 10 | 1.7% |
| 5 | 9 | 1.5% |
| 10 | 6 | 1.0% |
| 12 | 4 | 0.7% |
| Other values (5) | 16 | 2.7% |
| Value | Count | Frequency (%) |
| 0 | 343 | |
| 1 | 88 | 14.9% |
| 2 | 51 | 8.6% |
| 3 | 25 | 4.2% |
| 4 | 25 | 4.2% |
| Value | Count | Frequency (%) |
| 14 | 1 | 0.2% |
| 13 | 3 | |
| 12 | 4 | |
| 11 | 4 | |
| 10 | 6 |
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8016949153 |
|---|---|
| Minimum | 0 |
| Maximum | 14 |
| Zeros | 340 |
| Zeros (%) | 57.6% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 14 |
| Range | 14 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.394750226 |
|---|---|
| Coefficient of variation (CV) | 1.739751867 |
| Kurtosis | 19.74725646 |
| Mean | 0.8016949153 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.499281644 |
| Sum | 473 |
| Variance | 1.945328192 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) |
| 0 | 340 | |
| 1 | 141 | |
| 2 | 63 | 10.7% |
| 3 | 20 | 3.4% |
| 4 | 12 | 2.0% |
| 6 | 6 | 1.0% |
| 8 | 3 | 0.5% |
| 5 | 3 | 0.5% |
| 14 | 1 | 0.2% |
| 9 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 340 | |
| 1 | 141 | |
| 2 | 63 | 10.7% |
| 3 | 20 | 3.4% |
| 4 | 12 | 2.0% |
| Value | Count | Frequency (%) |
| 14 | 1 | 0.2% |
| 9 | 1 | 0.2% |
| 8 | 3 | |
| 6 | 6 | |
| 5 | 3 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| great | spiritual | mind | shall | therefore | things | heart | knowledge | soul | may | life | men | man | one | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 2 |
| 1 | 0 | 0 | 0 | 0 | 1 | 3 | 0 | 3 | 0 | 0 | 2 | 0 | 0 | 2 |
| 2 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 |
| 3 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 |
| 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 1 | 0 | 0 | 3 |
| 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| 7 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 8 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
Last rows
| great | spiritual | mind | shall | therefore | things | heart | knowledge | soul | may | life | men | man | one | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 580 | 1 | 0 | 0 | 0 | 1 | 4 | 0 | 1 | 2 | 0 | 0 | 1 | 2 | 1 |
| 581 | 1 | 0 | 0 | 1 | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 |
| 582 | 2 | 0 | 0 | 5 | 3 | 6 | 0 | 0 | 0 | 3 | 1 | 4 | 1 | 2 |
| 583 | 1 | 0 | 0 | 0 | 0 | 4 | 0 | 1 | 0 | 1 | 3 | 3 | 1 | 0 |
| 584 | 2 | 0 | 1 | 5 | 3 | 3 | 0 | 1 | 0 | 0 | 4 | 6 | 3 | 1 |
| 585 | 0 | 0 | 0 | 2 | 0 | 5 | 1 | 0 | 1 | 0 | 5 | 1 | 4 | 0 |
| 586 | 0 | 0 | 0 | 4 | 1 | 6 | 0 | 0 | 1 | 0 | 2 | 1 | 3 | 1 |
| 587 | 2 | 0 | 0 | 0 | 1 | 3 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 2 |
| 588 | 1 | 0 | 0 | 0 | 1 | 4 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 6 |
| 589 | 2 | 0 | 0 | 0 | 0 | 6 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 1 |
Most frequent
| great | spiritual | mind | shall | therefore | things | heart | knowledge | soul | may | life | men | man | one | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 54 |
| 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 11 |
| 23 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| 18 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| 28 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 5 |
| 11 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 4 |
| 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 4 |
| 16 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 4 |